AN ALGORITHM FOR RANDOM SIGNED 3-SAT WITH INTERVALS 
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Abstract. In signed fe-SAT problems, one fixes a set M and a set S of subsets of M, and is 
given a formula consisting of a disjunction of m clauses, each of which is a conjunction of k 
literals. Each literal is of the form "x G S", where S £ 5, and x is one of n variables. 
For Interval-SAT (iSAT), M is an ordered set and S the set of intervals in M. 
We propose an algorithm for 3-iSAT, and analyze it on uniformly random formulas. The al- 
gorithm follows the Unit Clause paradigm, enhanced by a (very limited) backtracking option. 
Using Wormald's ODE method, we prove that, if m/n < 2.3, with high probability, our algo- 
rithm succeeds in finding an assignment of values to the variables satisfying the formula. 



1. Introduction 

Let M be a (usually finite) set, S a set of subsets of M, and k a positive integer. For the 
signed k-satisfiability problem, or signed k-SAT, one is given as input a finite set of variables 
X and a formula in signed conjunctive normal form (CNF). This means that there is a list of 
clauses, each of which is a conjunction (A) of (signed) literals of the form x G S where x is a 
variable in X and the "sign" S is a set in S. The question is then whether there exists a satis- 
fying interpretation, i.e., an assignment of values to the variables such that each of the clauses 
is satisfied. This setting includes as a special case the classical satisfiability (SAT) problem. 
There, one chooses for M the 2-element set {True, False} and S = {{True}, {False}}. 

In case M is an ordered set (a chain) and the set S is the set of all intervals in M, we speak 
of Interval SAT, or iSAT. In our contribution, we set M := [0, 1], because this includes all 
iSAT settings with finite M. It is related to the study of random interval graphs [38, 32]. As 
is usually done in the context of SAT, we speak of /c-iSAT, if each clause comprises k Uterals. 
Our notation and terminology on signed SAT follows [19]. 

Signed SAT problems originated in the area of so-called multi- valued logic [35], where 
variables can take a (usually finite) number of so-called truth values, not just True or False. 
Work on signed CNF formulas started in earnest with the work of Hahnle and Manya and their 
coauthors. We refer the reader to the survey paper [ 10], and the references therein. 

The motivation for studying signed formulas was to extend algorithmic techniques devel- 
oped for deductive systems in multi-valued logic to better cover practical applications [29]. 
Indeed, on the one hand, a number of papers show how combinatorial problems can be solved 
using signed SAT algorithms [13, 14, 11,26, 15]; on the other hand, a large number of heuristic 
and exact algorithms have been studied (see [5, 16] and the references therein), and a number of 
polynomially solvable subclasses of signed SAT have been identified [23, 10, 36, 9, 6, 5, 19]. 
While in the works of Manya and his collaborators, order-theoretic propeities of the ground 
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set M are exploited to make conclusions on the complexity of signed SAT, Chepoi et al. [19] 
completely settle the complexity question in the general case by reverting to combinatorial 
properties of the set system S. In particular-, they prove that: signed /c-SAT, A; > 3, is poly- 
nomial, if, and only if, HsgS ^ ^ ^^'^ NP-complete otherwise; signed 2-SAT is polynomial 
if, and only if, S has the Helly-propeity (if no two sets in a subfamily are disjoint, then the 
subfamily has non-empty intersection), and NP-complete otherwise. 

For the case when S has the Helly-property, Chepoi et al. give a non-satisfiability certificate 
for signed 2-SAT in the spirit of Aspvall, Plass, and Tarjan's famous result for classical 2- 
SAT [7]. 

Most applications and a great deal of the earlier complexity results [10] focus on regular 
signed SAT, where M is a poset, and the formulas may only involve sets of the form S = {j \ 
j >i} ox S = {j \ j <i}. Regular iSAT (or just regular SAT) is regular signed SAT for posets 
M which are chains. 

In particular, for regular iSAT, random formulas have been investigated from a heuristic 
point of view. Manya et al. [37] study uniformly generated random regular 3-iSAT instances, 
and observe a phase transition similar to that observed in classical SAT (see [3] and the ref- 
erences therein): (i) the most computationally difficult instances tend to be found near the 
threshold, (ii) there is a shaip transition from satisfiable to unsatisfiable instances at the thresh- 
old and (iii) the value of the threshold increases as the number of truth values considered 
increases. Their results are confirmed and extended by further papers exploring uniformly 
random regular 3-iSAT instances [12, 10, 15]. 

Further, in [12, 15] a bound on the ratio ™/n is given, beyond which a random formula is 
with high probability (whp) unsatisfiable. To our knowledge, however, ours is the first rigorous 
analysis of an algorithm for random signed SAT. 

Our interest in the particular version of signed SAT arises from applications in computational 
systems biology, where iSAT yields a generalization of modeling with Boolean networks [33], 
where biological systems are represented by logical formulas with variables conespond to 
biological components like proteins. Reactions are modeled as logical conditions which have to 
hold simultaneously, and then transfened into CNF. The model is widely used by practitioners 
(see e.g. [22, 34, 30] and the references therein). 

Often, though, this binary approach is not sufficient to model real life behavior or even 
accommodate all known data. Due to new measurement techniques, a typical situation is that 
an experiment yields several "activation levels" of a component. Thus, one wants to make 
statements of the form: If the quantity of component A reaches a certain threshold but does 
not exceed another, and component B occurs in sufficient quantity, then another component C 
is in a certain frame of activation levels. The collection of such rules accurately models the 
global behavior of the system. We refer to [8] for details of models and applications. 

In this paper we present and analyze an algorithm which solves uniformly random 3-iSAT 
instances with high probability, provided that the ratio between the number m of clauses and 
the number n of variables is at most 2.3. Our algorithm is an adaption of the well-known Unit 
Clause algorithm from classical SAT [17, 2], where, in an inner loop, 1 -clauses are treated 
if any exist, and in an outer loop, a variable is chosen freely and assigned some value. This 
Unit Clause approach is enhanced with a "backtracking" subroutine not very much unlike the 
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one studied in [25] for classical 3-SAT, where the currently best known algorithm for random 
classical /c-SAT, /c = 3, is given (but see [21] for general k). 

In the analysis, we use Wormald's differential equations method [39]. ODE methods have 
been used for the analysis of algorithms for classical SAT with great success [17, 18, 25, 1,4]. 
In our analysis, we combine the idea of Achlioptas and Sorkin [4] to consider as a time step 
an iteration of the outer loop, but we use Wormald's theorem [40] where they use a Markov- 
chain based approach. The analysis of the inner loop requires to study the first busy period of 
a certain stable server system [1,2], or, in our case, more accurately, the total poplulation size 
in a type of branching process. 

Moreover, it is important to point out that we prove that our algorithm succeeds with high 
probability, unlike [1, 4], where the aim' is only to prove success with positive probability 
(wpp). To obtain a whp result, the "backtracking" part is essential", and its analysis has to be 
accommodated for, of course, in the analysis of the . 

The value 2.3 arises from the numerical solution to an initial value problem (IVP). Extending 
the results for A;-iSAT for > 4 is conceptually easy; we briefly discuss it in the conclusions. 

The outline of the paper is as follows: In the next section, we present our algorithm for random 
3-iSAT in detail. In Section 3, we prove some facts about uniformly at random chosen sub- 
intervals of [0, 1]. Section 4 we take a brief excursion to random 2-iSAT as our algorithm for 
3-iSAT ultimately relies on solving a 2-iSAT instance. In Section 5, we compile the required 
facts about total population sizes of a kind of branching system, which are then applied in 
Section 6 to the study of the inner loop of our algorithm. Finally, in Section 7, we prove the 
whp-result for our algorithm. We raise some issues for future research in the final section. 
Several technical arguments have been moved into the appendix. 

Throughout the paper, we hide absolute constants in the big-O-notation. If the constant de- 
pends on other parameters, we make this clear by adding an index, e.g., Oe(-)- As customary, 
we use the abbreviation iid for "independent and identically distributed" and uar for "uniformly 
at random". Whp and wpp are to be understood for n — > oo, with m = m{n) depending on n. 

2. An algorithm for random 3-iSAT 

In this section, we describe an algorithm which finds a satisfying interpretation if the number 
of clauses is m = cn with c < 2.3. 

2.1. The random model; exposure. For our random model, we choose a formula uar from 
the set of all possible formulas on n variables with m 3-clauses, each containing three distinct 
variables. As is customary in the context of random SAT, we use the language of "exposing" 
literals. Intuitively, the idea is that the information about each Uteral is written on a card which 
lies face down, until the information is exposed. Clearly, the unexposed part of the formula 
is uar conditioned on which literals have been exposed and which have not. We refer to the 
elegant description in Achlioptas paper [2]. 



This was an artifact of the hunt for the precise value of the threshold in random fc-SAT, for which, by Friedgut's 
famous result [24], wpp statements suffice. 

"Cf. for example, [25], where the range in which the algorithm succeeds increases dramatically, once a back- 
tracking part is added. 
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2.2. Brief description of the algorithm. The basic framework of our Algorithm is the same 
as for most algorithms for classical fc-SAT. A fomierly unused variable is selected, and a value 
is assigned to it. Then, clauses containing the variable aie updated: if the literal of the clause 
involving the variable is satisfied, the clause is deleted; otherwise the literal is deleted from the 
clause, leaving a shorter clause. The variable is removed from the set of unused variables, and 
declared a used variable. The algorithm fails if, and only if, it creates an empty clause. 

However, to a certain extent, our algorithm is able to repair bad choices it has made. Thus, it 
occasionally only assigns tentative values to variables. As long as it is not certain that a variable 
keeps its tentative value, no deletions of clauses or literals from clauses are performed. Instead, 
we assign colors to the clauses, which code the number of satisfied, unsatisfied, and unexposed 
literals they contain. The meaning of the colors will be explained in Table 1 but at this point 
it suffices to know that red clauses correspond to unexposed 1 -clauses, i.e., clauses with one 
unexposed literal and the variables in any other literal of the clause have tentative values which 
render the literals false. 

As said before, the basic approach is that of the Unit-Clause algorithm. The outer loop of 
the algorithm will maintain the property that there is no 1-clause. In each iteration of the outer 
loop, a variable is selected uar from the set of unused variables. Such a variable selected in 
the outer loop is refeixed to as a free variable. The inner loop is initialized by assigning a 
tentative value to this free variable, and then repeats as long as there are red clauses. In each 
iteration of the inner loop, a red clause is selected and serviced: the variable contained in the 
clause (the current variable of the iteration) is tentatively set to some value in such a manner 
that the serviced red clause becomes true. We refer to the variables selected in the inner loop 
as constrained variables. 

If, during a run of the inner loop, a situation is reached in which it is probable that an empty 
clause will be created, it backtracks. This happens when the following /a?aZ/fy is suffered: The 
current variable occurs in another red clause, other than the one serviced. If that happens, there 
is a 1/3 probability that the two intervals occurring in the two red clauses are disjoint [38], so 
that creating an empty clause is inevitable. 

For this situation, the inner loop maintains a rooted tree G of decisions it has taken so far. 
The nodes of the tree correspond to variables to which tentative values have been assigned, 
with the root being the free variable with which the run of the inner loop was initialized. The 
edges correspond to clauses. The fatality entails that a circle is closed. If that happens the 
values of the variables along the paths from the root to serviced literal are changed. Then, all 
other tentative values are made permanent, and the inner loop is restarted with the new formula, 
but this time without a free variable in the initialization. We refer as Phase I to the run of the 
inner loop before a repair occurs (or if no repair occurs), and as Phase II to the run of the inner 
loop after a repair has been performed. In Phase II, no further repair is attempted. Instead, 
if fatalities occur, the inner loop takes a Zen approach, and just moves on (it "goes Zen"). In 
Phase I, if a fatality occurs, there's the possibility that a repair is not possible. In this case, too, 
the inner loop "goes Zen". 

After all red clauses have been dealt with in either Phase I or Phase II, the tentative values are 
made permanent, and control is returned to the outer loop, which selects another free variable, 
and so on. 
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The outer loop terminates, if the number of 2-cIauses plus the number of 3-clauses drops 
below a certain factor c' of the number of unused variables. Then, it deletes an arbitrary literal 
from every 3-clause and invokes the exact polynomial algorithm by Chepoi et al. [19] to decide 
whether the resulting 2-iSAT formula has a satisfying interpretation. We will prove in Section 4 
that this is always the case if the ratio of the number of resulting 2-clauses over the number of 
unused variables is below |. 

The complete algorithm is shown below as Algorithm 1 (the outer loop). Algorithm 2 (the 
inner loop), and Algorithm 3 (the repair procedure). Throughout the course of the algorithm, 
for i = 0,1, 2, 3, we denote by Yi{t) the number of i-clauses, and by X{t) the number of 
unused variables, respectively, at the beginning of iteration t of the outer loop. Moreover, for 
an interval I, we denote by 

x{I) := argmin^.gj' |x — V2[ (1) 
the point in / which is closest to 1/2. We refer to the variable Xj which is selected in iteration j 
of the inner loop as the current variable of that iteration. 
Below, we will prove the following fact. 

Lemma 1. A single run of Algorithm 2 (including a possible repair and consequent Phase II) 
produces an empty clause, only if it "goes Zen". 

The performance of the algorithm on random 3-iSAT instances is analyzed in Sections 6 
and 7. There, we will prove the following theorem. 

Theorem 2. Let c := 2.3, and suppose Algorithm 1 is applied to a uniformly random iSAT 
formula on n variables with m 3-clauses. Ifm < cn, then, whp. Algorithm I creates no empty 
clause, i.e., it finds a satisfying interpretation. 

The value 2.3 is determined through the numerical solution of an initial value problem. It 
corresponds to the point in which the increase in red clauses in each iteration of the inner loop 
would become so large that the inner loop will not terminate. 

Algorithm 1 UC w/ backtracking (outer loop) 

(o-l) Given: 3-CNF-formula; positive constant c'. 
(o-2) t:=0 

(o-3) While ¥2(1) + Ysit) > c'X{t): 

(0-3.1) Choose a variable x uar. 

(0-3.2) Invoke Inner loop (Phase I). 

(0-3.3) t:=t + l 
(o-4) In every 3-clause, remove one literal at random. 

(o-5) Invoke Chepoi et al.'s algorithm (cf. Section 4) for the remaining 2-iSAT formula. 



2.3. Comparison to algorithms for classical SAT. For classical SAT, if a variable x is set to a 
value, the probability that a random literal containing x evaluates to true is 1/2 — independently 
of the value. As will become apparent in the next section, this is far from true for random 
interval literals. There, the value 1/2 is the single, most likely value to be contained in a random 
interval (the probability is 1/2) and all other values are less likely. Hence, we will assign 1/2 to 
the variables as long as possible which is for aU free variables. 
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Algorithm 2 Inner loop 
(i-1) Given: 

• In Phase I: formula consisting of 2- and 3-clauses only; a (free) variable xq. 

• In Phase II: fonnula consisting of 1-, 2- and 3-clauses. 
(i-2) j := 

(i-3) Initialize: Expose the occurrences of xq in all clauses. 

• In Phase I only: 

(i-3.1) Tentatively set xq to 1/2. 

(i-3.2) Initialize the graph G := ({xq}, 0). 

• In Phase II only: 

(i-3.1) Color all 1-clauses red. 
(i-4) Expose the intervals associated with xq. Color clauses containing xq according to Tab. 1. 
(i-5) J := i + 1 

(i-6) If there is no red clause, exit inner loop: Set all variables to their tentative values; remove 
satisfied clauses and remove violated literals from their clauses; return to outer loop. 

(i-7) Select a red clause Cj at random; let Lj be the unexposed literal in Cj; expose current 
variable Xj of Lj 

(i-8) Expose all occurrences of Xj in colored clauses. 

(i-9) If Xj is contained in a red clause other than Cj : 

• In Phase I only: 

(i-9. 1) If there is a red, blue, or black 3-clause: go Zen! 

(i-9. 2) If the graph G contains a cycle, or x^ is in a blue clause: go Zen! 

(i-9. 3) If Xj occurs in three or more red clauses (including Cj): go Zen! 

(i-9.4) Otherwise: Phase I is completed. Let c' be the unique red clause different 

from Cj containing xj in a literal l' = xj G j'. Repair the unique path 

between xq and Cj ; then initiate Phase II. 

• In Phase II only: Go Zen! 

(i-10) Expose all occurrences of Xj in all uncolored clauses. 

(i-11) For every uncolored 2-clause Xj £ I V y £ J containing Xj, add to G the vertex y and 

the edge x^ G / V y € J between x^ and y. 
(i-12) Tentatively set x^ to 

(i-13) Update the colors of all clauses containing Xj. 
(i-14) Goto step (i-5). 

Algorithm 3 Repair path 

(r-1) Given: Set of colored 1-, 2- and 3-clauses; a literal l' = x^ G j'; a path of the form 

XQ, xoGJoVxiGIi, xi G Ji V X2 G I2, Xfc_i G Jfc_i V Xfe G Ife; 

(r-2) Fori = 0, . . . , A; - 1: 

(r-2.1) Set Xj (permanently) to x{jj) 
(r-3) Set xfc (permanently) to x( j') 

(r-4) Set all variables from Phase I, except those which have just been set in (r-2) and (r-3), to 
their tentative values; remove satisfied clauses and remove violated literals from their 
clauses. 
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Color 



Meaning 



Uncolored 

Black 

Red 



Blue 



Pink 



Turquoise 



All literals in the clause are unexposed. 
All literals are exposed. 

The clause has precisely one unexposed literal. The tentative values of any other 
variables in the clause make the coixesponding literals false. In particular, unex- 
posed 1 -clauses are red. 

The clause contains precisely one unexposed literal and at least one exposed 
literal which evaluates to true for the tentative value of its variable. 
The clause is a 3-clause, precisely one of its literals is exposed, and this literal 
evaluates to false for the tentative value of its variable. 

The clause is a 3-clause, precisely one of its literals is exposed, and this literal 
evaluates to true for the tentative value of its variable. 



Table 1. Semantics of the colors of the clauses. 



The rationale behind assigning the value 1/2 to free variables is two-fold. Firstly, it makes 
the analysis a lot more easy than if one tries to find a maximum cardinality subset of literals 
containing x all of whose intei-vals have paiwise non-empty intersection. Secondly, for large 
numbers of literals containing x, the maximum cardinality of a subset with pairwise intersect- 
ing intervals is asymptotically attained by taking all literals with intervals containing 1/2 (this 
is Theorem 4.7 of Scheinennan's paper [38]). 

The situation for consti'ained variables is similar-, but a bit more complicated. For con- 
strained variables, we are free only to choose the value for the variable within the interval I 
for the literal L = x G I which we wish to satisfy. Unlike to classical SAT, where this does 
not change the probability that other random literals containing x are satisfied, depending on 
I, this probability may change considerably. Moreover, for two literals containing x, the two 
events of both being satisfied simultaneously with L are not independent. 

However, an adaption of Scheinennan's argument mentioned above shows that, asymptoti- 
cally, the best choice is to take the point I which is closest to 1/2 as we do in our algorithm. 

Concerning the backtracking part of the algorithm, we would like to point out the difference 
to the approach in [25]. If the (essentially identical) fatality is suffered, a very elegant remedy 
is to simply flip the values of all variables with tentative values: if the tentative value of a 
variable is True, make it False, and vice versa. Needless to say, for variable values in a larger 
set, there is no obvious choice for the new value of a variable. Thus, in our approach, we 
have to choose the variable values in a smart manner, with the single aim to undo the fatality. 
Namely, those variables that led to the fataUty are assigned x{I) as described in Repair Path 
(Algorithm 3). 

2.4. Proof of the Go-Zen lemma. 

Proof of Lemma 1. Assume that Algorithm 2 does not go Zen. 

The only place where a 0-clause can be generated without having gone Zen is in the final 
step 4 of the repair. Algorithm 3. Clearly, none of the clauses on the path will become empty. 
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Moreover, setting the final vaiiable, x,fc, cannot create an empty clause, because of the con- 
ditions in steps (i-9.1) and (i-9.2). 

For a 3-clause to become empty, it is necessary that when the repair is invoked in Algo- 
rithm 2, all thi^ee of its literals have been exposed (possibly in the same iteration). In other 
words, it must have been red, blue, or black in step (i-9.1), a contradiction. 

For a 2-clause to become empty, both literals must have been exposed, one of them possibly 
in the iteration where the repair occurs. Moreover, if it was blue, the value of the variable 
satisfying one of its literals must change during the repair. In other words, the following three 
scenarios are possible: 

( i) it was black before the repair was invoked 

( ii) it was red before the repair was invoked, but it contains xj 

(Hi) it was blue before the repair was invoked, it is of the form Xj € Ij V Xj G Ij for some 
i < j, and Xj is one of the variables set in step (r-2) of Algorithm 3. 

In case (i), if the black 2-clause becomes an empty clause, either it was red when its final 
literal was exposed, a contradiction, or it was blue, which means that at least one of its variables 
lies on the path which is repaired. If the whole clause lies on the path, we have already noted 
that it cannot become empty. If only one of its variables is on the path, then it must be an 
edge in the tree having one end vertex on the path and the other lying further away from the 
root than the path. The fact that it is black means that the variable which is not on the path 
was the current variable of some earlier iteration i < j. But then the conesponding literal was 
either the selected literal Lj, in which case it was satisfied by the tentative value of Xj, or the 
if-condition in step (i-9) for iteration i held, which is a contradiction (either a repair occurred, 
or the algorithm went Zen). 

In case (ii), if the 2-clause is on the path, it does not become empty. If it is the unique other 
red clause c', then it will be satisfied in the initialization of Phase II. 

Case (iii), is not possible because of the condition in step (i-9.2) □ 

2.5. Random formulas. The following easy facts (see the discussion at the beginning of this 
section) underUes the analysis of the algorithm on random formulas. 

Lemma 3. If Algorithm 1 is invoked with a uar random 3 -iS AT formula, then 

(a) at the beginning of each iteration of the outer loop, the current formula is distributed uar 
conditioned on the number of unused variables, 2-clauses, and 3 -clauses; 

(b) at the beginning of each iteration of the inner loop, the current formula is distributed uar 
conditioned on the number of unused variables, 1 -clauses, 2-clauses, 3 -clauses, and the 
colors of the clauses. 

(c) at the beginning of Phase II in the inner loop, the current formula is not only conditioned on 
the number of unused variables, 1 -clauses, 2-clauses, 3 -clauses, the colors of the clauses, 
and the list L of clauses which are known not to contain xq and the list of clauses in which 
an occurrence of xq has been exposed. 

By Lemma 3, the history of the random process defined by the outer loop, that is, for each 
t, the state of the formula and all other information relevant to how the algorithm will proceed, 
available at the beginning of iteration t, is completely determined by 

J^{t) := {X{t),Y2{t),Y3m (2) 
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in particular it is Markov. 

3. Computations for random intervals 

In this section, we make some computations regarding intervals chosen uar from the subin- 
tervals of [0, 1]. We refer to [38, 32] for background. 

We aim to study the event x{I) € J, with two random intervals / and J (x is defined in (1)). 
We start with the following observation. 

Lemma 4 ([38]). For x E [0, 1] and for a random interval I, we have 

P[x £ I] = 2x{l - x). 

In particular, the probability that a random interval contains the point 1/2 is 1/2. 

The cumulative distribution function of x{I) can be written down. 

Lemma 5. For a random interval I, the random variable x{I) has cumulative distribution 
function 

'0 ift < 

"^^'^-U-H-tr, ift>y. 

.1 ift>l. 
Proof. Direct computation. □ 

Let X be a random variable with cumulative distribution function F as in (3), and define 

P ■.= 1-2X{1- X). (4) 

Thus, by the previous two lemmas, the probability that, for two random intervals / and J we 
have x{I) G J, is 

E(P[;r(/) G J I P]) = E(l - P) = 1 - E P. 
The following computations are straightforward, see A. 1 . 
Lemma 6. 

(a) EP = 13/24 

(b) EP2 = 3/10 □ 
Lemma 7. For two random intervals I, J, the following is true. 

Proof. Immediate from Lemmas 4, 5, and 6(a). □ 
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4. 2-lSAT 

In this section, we take a brief glance at the situation for random 2-iSAT. The reason is that, 
ultimately, our 3-iSAT algorithm reduces the 3-iSAT formula to one with exactly two literals 
per clause, and then invokes the polynomial time algorithm by Chepoi et al. [19] to find a 
solution. We need to make sure that the resulting random 2-iSAT instance is satisfiable. We 
make no attempt to optimize the bound for we obtain. 

For this, we proceed along the same lines as [20], using Chepoi et al.'s Aspvall-Plass-Tarjan- 
type [7] certificate for the non-satisfiability of signed 2-SAT formulas for set systems satisfying 
the Helly-property. We describe the certificate now. 

For a 2-iSAT formula F, define a digraph Gp which contains two vertices labeled xli and 
xl/, respectively, for every literal x G / occurring in F. For every clause x G I V x' G /' 
of F, the digraph Gp contains two arcs xl/ x'l't and x'l'/ — xlt. We refer to these 
arcs as clause arcs. Moreover, for every two literals x G I and x G J occurring in F, if 
I n J = 0, the digraph Gp contains the two arcs xlt — xJ/ and xJt — > xl/. These arcs we 
call disjointness arcs. 

For a literal x G / occurring in F, we refer to the vertex xlt as a positive vertex, and to 
xl / as a negative vertex. Moreover, we say that these two vertices are complements of each 
other; in other words, the complement of the (positive) vertex xlt is the (negative) vertex xl/ 
and vice versa. Note that arcs originating from negative vertices are clause arcs, while arcs 
originating from positive vertices are disjointness arcs. 

Chepoi et al. relate the satisfiabiUty of F to the strongly connected components (S'CCs) of 
Gp. 

Proposition 8 (Aspvall-Plass-Tarjan-type certificate, [19]). The formula F is satisfiable if, and 
only if no SCC of Gp contains a pair of vertices which are complements of each other. 

Remark 9. A path in of length £ contains \_l/2\ or [l/2\ disjointness arcs, and no two of 
them are incident. 

Chepoi et al. also give an algorithm which determines, in polynomial time, whether a for- 
mula F is satisfiable, and if it is, produces a satisfying interpretation. We refer to their paper 
for details. 

From Proposition 8, we obtain the following corollary. 
Corollary 10. IfF is not satisfiable, then Gp contains a bicycle, i.e., a directed walk 

■uo • • • ue+i, 
with at least one clause-arc, and the following properties: 

(a) the literals in the vertices ui, . . . ,Ui are all distinct; 

(b) the literals in the vertices uq and Ui^i occur among the literals in the other vertices; 

(c) the clauses in the arcs are all distinct. 

Proof. For a vertex v, we denote its complement by By what we said about the different 
types of arcs, on every path from v to v, there is at least one clause arc. 

Choose an SCC and take a pair of complementing vertices v and v in the SCC such that the 
distance from f to i; in Gi? is minimal. Then, on the shortest path P from v to v, no literal 
appears twice. Denote by L the hteral defining v and v. 
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Now take a shortest path Q in Gp form v to v. If there is no literal other than L which 
appears twice on P U Q, then P U Q is a bicycle starting and ending in v. On the other hand, 
if there is a literal l' other than L which appears twice on P U Q, then the desired bicycle is 
constructed by taking the path P from v to v, and then the path Q until the first vertex whose 
literal already occurred earlier. □ 

Suppose a 2-iSAT formula with n variables and m = cn clauses is drawn uniformly at 
random from the set of all such formulas (with the intervals all in [0, 1]). We estimate the 
asymptotic probability that such a formula is satisfiable. 

Proposition 11. Let d < 3/2. If m < c'n then, whp as n ^ oo, a randomly drawn 2-iSAT 
instance is satisfiable. 

The proof mimics that of Chvatal & Reed [20] for the classical 2-SAT very closely; we 
include it here just to point out where the number 3/2 comes in. 

Proof. Given a fixed bicycle uq —)■••• ^ Ui+i with r clause-arcs, the probability that it occurs 
in Gi? is at most 



m 



P 



where p := 1/3 is the probability that two independently chosen intervals are disjoint [38]. 
Hence, the expected number of bicycles with r clause-arcs occurring in Gi? is at most 



Thus, the expected total number of bicycles is at most 

'-^T.^\2c'pY-\ 

r=l 

The sum is finite if, and only if, 2c'p < 1, i.e., c' < 3/2. Thus, in this case, the probability that 
a bicycle exists is Oc'{l)- □ 

Thus, for every c' < 3/2, whp, a satisfying interpretation can be found by Chepoi et al.'s 
algorithm [19]. 

5. Total population size of our branching system 

As is done in classical SAT, the sub-routine eliminating the unit clauses can be viewed as a 
"discrete time" queue in which customers (i.e., unit clauses) arrive per time unit, the number 
depending on the customer currently serviced, and the single server, corresponding to one run 
of the inner loop of the algorithm, can process at least one customer per time unit. The number 
of iterations of the sub-routine then roughly corresponds to the length of the (first) busy period 
of the server. 

Here, since, we are only interested in the length of the first busy period, the "queue" is really 
a branching system, for which for which we need to know the total number of individuals which 
are born before extinction. Compared to classical SAT, the interval-version poses several small 
challenges which we address in this section. 
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Let a be a non-negative integer, and B{j), j = 0,1,2, ... , random variables taking values in 
the non-negative integers. We say the following sequence of random variables Q{j) a discrete 
queue: 

Q(0) = 
Q{l) = a 

\, ifQ(j)=o 

Q{j)-l + B{j + l) ifQ(i)>0 

The number + 1) is the number of individuals of the branching system after the jth indi- 
vidual has reproduced and died. 

Denote by Z the length of the first busy period of the server, that is, the total population size 
of the branching process: 

Z:=sup{j >0 I Q« >0 Vi = l,...,i} = inf{j > I = 0} - 1. 

A straightforward adaption of the branching-process based textbook arguments for continuous- 
time M/G/1 -queues gives the following (see A.2). 

Lemma 12. Suppose the B{ j), j = 1,2,..., are iid with mean Xb and common probability 
generating function g^. The probability generating function hof Z satisfies 

for every y for which the power series (y) converges and does not vanish. In particular, if 
Xb < 1, we obtain 

EZ = ^. ,5b) 

Moreover, we have 

P[Z >a]< (5c) 

L — ' — ycx—a ^ ' 

for all a > and y > with y > g^ (y). □ 

Remark 13. Since we are only interested in the first busy period, we make the following mod- 
ification to the definition of Q: If Q{j) = but j > 0, then we let Q{j + 1) = (and not 
QU + 1) = a as above). This makes some inequalities less cumbersome to write down. 

5.1. Bounding the tail probability for iid binomial B. Let P be a random variable with val- 
ues in [0, 1]. We say that a random variable B has binomial distribution with random parameter 
P, or Bin(m, P), if 



P[B = k\P = p]= (^^J/(l -p) 

In our setting n is a (lai^ge) integer, and m = m{n) is an integer depending on n. Define 
A = A(n) := ^. Let P be as in (4), and suppose that B is Bin(m, 2P/n). 

Lemma 14. If X{y — 1) < we have 

fe(y)<exp(i|A(y-l) + |A2(y-l)2) 
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Proof. We have e* < 1 + t + for all t <1. For ease of notation, let r := E P = 13/24 and 
T2 := E(P^) = 3/10, by Lemma 6. Since (y — l)X2P < 1 with probability one, the following 
estimate holds: 

m ^ m >. 

sAv) = Ee((T)p'=(i - f r-'^) = E E (T)/(i - ZT-'] 

k=0 ^k=0 ^ 

= E((l + {y- l)^r) < Efe^^s'-i)^^) < E (l + 2(y - 1)AP + 4(y - ifX^P^) 



12' 

as claimed. □ 

Now suppose that P{j), j = 1, 2, . . . , are iid random variables distributed as P defined 
in (4), and that B{j), j = 1, 2, . . . , are iid random variables distributed as Bin(m, 2P(j)/m). 

Lemma 15. For every e > there exist S > and C > 1 such that, if'^/2 < ^\ <\ — e, the 
following is true. 

For all a > Ca, there exists a y with 1 < gg{y) < y < 2 such that 

Proof. For ease of notation, let := y — 1 and r := y|A, so that 1/2 < r < 1 — e. If 
< u < < 1, by Lemma 14, we may estimate 

<j,(y) <exp(i|An + |AV), 

and thus obtain 

P[Z >a]< exp(a(i|An + f A^n^) - {a - a) log(n + 1)). 
Using Lemma 6, we write the exponent as 

aru + ^^ar'^v? — {a — a) \og{u + 1). (*) 

In order to find a u minimizing (*), we take the derivative and solve the resulting quadratic 
equation 

_lg^^2^2 ^ ^ _1|^^2^ u-{l-r)+a/o. = {) (**) 

The value of u which works is the larger one of the two roots: 



2-123 „ 



0{<^/a), {***) 



with an absolute constant in the O(-) (see A.3 for the computation). The numerator is greater 
than zero if, and only if, 4 • jj^r < fijf?' which is equivalent to r < L Thus, there exists a 
C depending only on r, such that > whenever a > Ca. Moreover, by letting u = 
in (**), we see that Ur < < 2, as required. Letting u = Ur in (*), we obtain, for a > Ca, 
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with an absolute constant in the O(-), where 

5,.{u) = ru+ |xilr-\2 _ iog(u + 1) 

(see A.3 for the computation). We have 5r{u^) < 0, because 5^(0) = and since, by the 
choice of Ur, the derivative of 6r in the open interval [0, Ur[ is negative. This also implies that 

y > 9b (y)- Let 

(5* := max[6r{ur) \ '^/^ < r < 1 — e] > 0, 
Finally, increase C, if necessary, to take care of the dependence on 0{^/c) in (***) and (****), 
and define S := —6^/2. This completes the proof of the lemma. □ 

Lemma 16. TfA < (1 - e)^, then 

EZ = -43- (7a) 

and there exist J > and C > 1 depending only on e, such that for all a > Cawe have the 
upper tail inequality 

P[Z > a] < e""^". (7b) 

Proof. Equation (7a) is directly from Lemma 12. 

Lemmas 12 and 15 together imply the tail inequality in the case when y|A > 1/2. For 
smaller values of A, we just note that increasing A increases the length of the first busy period, 
so that the probability for A := 6/13 gives an upper bound for the probability for smaller values 
of A. □ 

5.2. Not-independent binomial. The arrivals at the queue in the context of our algorithm are 
not completely independent. Here we deal with the small amount of dependence. 

We now describe what kind of B{j) we allow. The setting is that n is a (lai'ge) integer, and 
that m = m{n) = 0(n). Let r > 1 and 

z = Zr = Zr{n) := I logn, (8) 

where 6 is as in Lemma 16. Suppose that M (j), N{j) are random variables satisfying 

n — j < N{j) <n + j for all j, (9a) 

< M (j) <m for all j, (9b) 

with probability one, and 

m~ < M{j) < rri^ for all j = 1, . . . , z (9c) 

with probability at least 1 — 0{n~'^'). Let the B{i) be distributed as Bin(M(j), ^^y) for all j. 
More accurately, we assume that there is an iid family of P{j), j = 1, 2, 3, . . . , distributed as P 
above, and an independent family of random variables U{j,i), j = 1, 2, 3, . . . , i = 1, 2, 3, . . . 
each having uniform distribution on [0, 1], and that the joint distribution of the B{j) is the same 
as for the family of sums 

(10) 



- N{j) 
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The P{j) and U{j, i) are assumed to be joindy independent, but we make no assumptions 
about independence regarding the M (j) and N{j) among themselves or from the U{j,i) and 
P{j). However, we do assume that a, the M (j), and the N{j) are such that 

oo 

a + J2B{j) = 0{n) (11) 

holds with probability one. 

Lemma 17. Let = X^{n) := and suppose z > Ca. If 

A+<(l-e)i|, (12) 
then with the 6 and C from Lemma 16, the following holds for large enough n: 

^ "3, -0{n'~l< BZ < '^3 +0(n^-); (13a) 

and for all a > Ca 

P[Z > a] < e"''" + 0(n~''). (13b) 

The proof can be found in the appendix: A.4. 

Remark 18. There is no danger in assuming 6 < 1 and C > 1, and we wiU do that from this 
point on. 



6. The inner loop 

Here we analyze Algorithm 2. Conditioning on X{t), Y2{t), and Y^(t), we analyze the 
changes of the parameters X, Y2, and during the t + 1st run of the inner loop, and bound 
the probability that an empty clause is generated. 

From now on, n and m denote the number of variables and clauses, respectively, in the 
initial random CNF formula, with m = cn for some constant c. We assume c < 10, to get rid 
of some of the letter c in the expressions below. For any e > 0, we say that (x, 2/2, ys) £ is 
e-good, if 

en<x and ^<(l-e) — , (14) 
X 13 

and that M'{t) is e-good if {X{t), Y2{t) ,Y-i{t)) is e-good. 

6.1. Setup of the queues for Phases I and II. We now define the queues for corresponding 
to the Phases I and II. We will suppress the dependency of the random processes on M'{t) in 
the notation. 

We define the queues Qj and Qu for the Phases I and II, respectively, by modifying Algo- 
rithm 2 a little bit. We will then analyze (the original) Algorithm 2 with the help of the queues 
Qi and Qn defined via this modification. The changes we make ai^e the following: replace 
step (i-7) by 

(i-7') If there are unused variables left, choose one uar; 
and step (i-8) by 
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(i-8') Expose all occurrences of the current variable xj in clauses colored with 
a color different from red; 

moreover, in the modification, we do not initiate a repair (since that would kill the queueing 
process). 

Since, with these modifications, red clauses can contain used variables, it is possible to run 
out of variables before running out of clauses. It can be easily verified that this can only happen 
when all clauses are red. Hence, in this situation, the modified algorithm will just eat up the 
red clauses one per iteration. 

In the Phase-I queue Qj, the number of customers arriving in the first time interval, Aj, is 
the number of red clauses generated by setting the free variable xq (tentatively) to 1/2. Thus, 
Aj is distributed as Bm{Y2{t), 5^)- For the iterations j = 1, 2, 3, . . . , we find that Bj{j + 1) 
is the number of uncolored 2-clauses which become red, plus the number of pink 3-clauses 
which become red, when setting the current variable Xj (tentatively) to x{lj). Thus, if we 
denote by (i) '^^e number of uncolored 2-clauses plus the number of pink 3-clauses at the 
beginning of iteration j, then conditioned on Y2{j), the distribution of Bi{j + 1) is that of 
Bin(y2(j), x^{t)-] )' where as in the previous section, the P{j + 1) are iid random variables 
distributed as P defined in (4). If we agree on the convention that a Bin(0,p/o)-variable is 
deterministically 0, this also holds when the queue runs out of variables. 

In the Phase-II queue, the number of customers arriving in the first time interval. An, is the 
number of unit-clauses generated at the end of Phase I by setting the variables to their tentative 
values. The are defined analogous to the 

At this point, note that the condition (11), which is needed for Lemma 17, is satisfied for 
both queues. 

6.2. Bounds for the probabilities of some essential events. Below, we repeatedly use the 
following simple Chernoff-type inequality (e.g. equation (2.11) in [31]): if [/ is a binomially 
distributed random variable with mean fi, then 

P[U >a]< e~" for a > 7^. (15) 

Lemma 19. Let r > 1, 1 < 2 = z{n) = o{n) an integer, (x, 1/2,1/3) £-good for some e > 0, 
and m~ := max(0, 1/2 — rz log n), m"*" := y2 + rz log n. For both phases I and II of the inner 
loop, the following is true. If, at the beginning of the phase at step (i-1 ), there are x variables, 
j/2 2-clauses, and 2/3 3-clauses, then the probability that, while dealing with the first z variables 
in the phase, the number of 2-clauses leaves the interval [m~, m"*"], is 0{n~^). 

Proof. For the upper bound m+, the probability that the number of 2-clauses exceeds m+ 
is bounded from above by the probability that one in a sequence of z independent random 
valuables with Bin(?7i, -distributions is greater than r log re. Here the factor 1/2 on the 
denominator takes care of the z = o{n) variables which are used. For re large enough, this 
probability is at most 

For the lower bound m", the probability can be bounded by the same argument, noting that, if 
m~ = 0, the corresponding probability is 0. □ 
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Let R denote the event that a repair is invoked during this run of Algorithm 2. Moreover, 
denote by Zj and Zjj the length of the first busy period of the Phase I and Phase II queues, 
respectively. Note that they depend on Aj and A//, respectively. Further let Mj and M// be the 
total number of colored clauses which are generated during Phase I and Phase II, respectively; 
let Hj and Hjj the event that, in some iteration, in steps (i-8), the cuiTcnt variable is found to 
be contained in a colored clause (other than the current clause Cj); and by Hf the probability 
that in Phase I the current variable is found to be contained in at least two colored clauses (other 
than the current clause Cj). 

Lemma 20. Suppose that J^{t) is 2e-good. With the 6 := d{e) and C := C{e) from 
Lemma 17, and r > 1, the following is true for all n large enough (depending on e). 



nzn 

V[Mn > 



P[Aj > rlogn 1 


= 0{n-') 


(16a) 


P[Zj > friogn 1 


= 0{n-') 


(16b) 


P[Mi > 5°°^ r log 71 1 J^{t)] 


= 0(n~n 


(16c) 


p[Hj 1 


= o.(^) 


(16d) 






(16e) 


p[R 1 


= OeC^) 


(16f) 


> {r + 1) log n 1 ^(t) & R] 


= 0{n"-) 


(16g) 


> 5°0f'(r + l)logn 1 


= 0{n-') 


(16h) 


250000C^ (r + l)logn | jr{t) k R] 


= Oin'') 


(16i) 


P[Hn 1 ^(t) & R] 




(16j) 



Proof. For (16a), if is 2e-good, then the probability that Ai > logn is bounded from 
above by the probability that a Bin(m, 2^^) -variable is larger than r log 77,, which is at most 
n~^, for 77, large enough, by (15). 

Proof of (16b). We use Lemma 17 together with Lemma 19 to bound the conditional prob- 
ability that Zj > a. If Jif{t) is 2e-good, then the 777+ from Lemma 19, with x := X{t), 
1/2 := Y2{t), 7/3 := Y^{t), and the z = Zr from (8), is such that (12) is satisfied if 77 is large 
enough depending on e. 

The requirement for the estimate in (13b) is that Aj < qq '•= min("/c, ^r/c). Thus, for the 
probabilities conditional on J^{t), we have 

P[Zi > a] 

= P[Zi > a I ^/ < ao] P[^/ < ao] + P[Zi > a | A/ > ao] P[^/ > ao] 

< 0(6--^") + 0(77-^) + P[Ai > ao]. 

With a := ^r\ogn, using (16a) and (15), the right-hand side is 0(77"''). 

Proof of (16c). For every iteration, a clause is only colored if the current variable of the 
iteration is contained in the clause. Hence, the number of clauses colored in the first j iterations 
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is upper bounded by the sum of j independent Bin(m, ^)-variables. Hence, the probability 
that in the first j iterations, the number of colored variables exceeds ja is at most e~" by (15), 
provided that a > > 7 ■ -^^j- Moreover, we have Mj < m with probability one. 

Thus, conditioning on Jif{t) (and keeping in mind that J^(t) is required to be 2e-good), the 
probability that Mj is larger than M^-f log n is at most 

Proofs of (16d) and (16e). In the first phase, in the jth iteration, the probability that the current 
variable Xj occurs in a colored clause (other than the current clause Cj) is Q( x(f)-Z7 )' ^^'^ 
probabiUty that the number of colored clauses containing Xj (other than the current one Cj) is 
twoormoreisO((^^(|^)'). 

By (16b) and (16c), we can bound the probability that this happens in the first Z/ iterations 

by OeC-^) + 0{n~'') and OeC-^) + 0{n^''), respectively, where the constant in the Oe{-) 
depends only on e. 

Proof of (16f). Clearly, the probability that a repair occurs is at most the probability that, in 
some iteration, the current variable Xj occurs in a colored clause (other than the current one 
Cj). Thus, the inequality follow from (16d). 

Proof of (16g). Since Ajj < Mj, this inequality follows from (16c) and (16f), with r replaced 
by r + 1, by conditioning on R: 

P[Mi > ^(r + 1) logn I & R] 

< P[Mi > ^(r + 1) log?! I J^{t)]/P[R I J^{t)] 

Proof of {I6h). We now apply Lemmas 17 and 19 to the Phase-II queue. Let/ := 

If J^{t) is 2e-good, then the m+ from Lemma 19, with x := X{t), y2 := Y2{t), := ¥3(1), 

and the z = Zr' from (8), is such that (12) is satisfied if n is large enough depending on e. 

Again, the requirement for the estimate in (13b) is that Ajj < a'^ := min("/c, ^r'/c). Thus, 
for the probabilities conditional on (i) Sz R, we have 

P[Zjj > a] 

= P[Zn >a\An< ao] F[An < a'^] +F[Zij>a\ Ajj > oq] P[An > a'^] 

< 0(e-^") + 0(n-^'') + P[An > 

With a := ^^^{r+l) log n, we have = log n, so that, by (16g), the probability 

that Ajj > is 0{n^^). In total, we obtain an upper bound of 0{n^^) for the probabihty that 
^//>^(r + l)logn. 

Proof of (16i). For every iteration, a clause is only colored if the current variable of the iteration 
is contained in the clause. Hence, the number of clauses colored in the first j iterations is 
upper bounded by the sum of j independent Bin(m, —^-variables. (The factor of 1/2 in the 



sn/2 

denominator is to take care of the fact that the number of variables, while starting with at least 
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en, might drop below en during the run of Phase I or Phase 11.) Hence, the probabihty that 
in the first j iterations, the number of colored variables exceeds ja is at most e~° by (15), 
provided that a > > 7 ■ J^J- Moreover, we have Mjj < m with probability one. Thus, 
conditioning on J^(t) & R (and keeping in mind that J^{t) is 2e-good), the probability that 



Mjj is larger than 



FX" 



[r + 1) log n is at most 



0(e" 



- (f +1) log n 



)+mP[Zjj> 5-9^{r + l)logn \ je{t)] 

= 0{n~'') + 0{mn 



-500r\ 



0{n 



Proofof (16j). In the second phase, in the jth iteration, the probability that the cunent variable 
xj occurs in a colored clause (other than the current one Cj) is 0{j^jjp-^). By (16h) and (16i), 

□ 



we can bound the probability that this happens in the first Zjj iterations by Os C°^^ " )+0(n 
where the constant in the Oe(-) depends only on e. 



6.3. Changes of the parameters X(t), Y2{t), and Y3{t). We now move to study the differ- 
ences between successive values of these parameters, and we start with X{t + 1) — X{t). 
Denote by Fj and Fjj the number of iterations of the inner loop in the first and second phase, 
respectively. Clearly, X{t) — X{t + 1) = I + Fj + Fjj, where the leading 1 accounts for the 
free variable xq. Moreover, we have Fj < Zj and Fn < Zjj, and the inequality can be strict 
for two reasons: in Phase I, a repair can occur, thus terminating the phase before Qj drops to 
zero; in both phases a red clause can vanish (i.e. become black) in (i-9). However, note that 



Fj = Zi 
FiI[R] > ZiI[R] 
by(16d), (16f)and(16e). 



Let us abbreviate 



AX :-- 



-1 



Y2it) 



1 



i3y2(t) 

12X{t) 



-1 



with probability 1 
with probability 1 

UY2{t) 



Oe 



' log 



n 

I log"' 1 



and 



(17) 



VlX(t)-Y2{t) 



VlX(t) - \'SY2{t) llXit) - \'SY2{t) 

Lemma 21. IfJ^(t) is 2e-good and n large enough depending on e, then 

1 - AX - E(Z7 I .^{t)) 

AX - E{X{t + 1) - X{t) I 



and 



\X{t + l) - X{t)\ > log^n 



0{n 



(18a) 
(18b) 

(18c) 



Proof. By what we have said above on the relationship between Fj, Fjj and X{t+1)—X{t), we 
have F/ = Z[ I[R] — Ej and Fu = Zu — En, where Ei and En are error terms accounting 

for red clauses vanishing. We have E(F/ | M'{t)),E{Eii \ J^{t)) = OeC-^) by (16d) 
and (16e) (noting that Ej, Ejj < m). 
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We compute the the mean of Z/ using Lemma 17 with the from Lemma 19 with 
z := ^ log n as in (8). Thus, letting logn (the bound from Lemma 19), for each 

a, conditional on Aj and J^{t), we have 



12X{t)+z 



13Y2it)+v ' 
12X{t)-z 



SO that 



Ai 



1 



i3y2(t) 

12X(i) 



+ 0e{ 



Ai log n 



provided that Ai < ^/c, which holds with probability at least 1 — 0(n ^) by (16a) by increas- 
ing, if necessary, r beyond 26C. Since Zj = 0{n) with probability one, we obtain 



E(Z/ I J^{t)) = E 



Ai 



i3y2(f) 

l2X{t) 



V2(t) 



_ E(^j I ^(t)) . log^n x _ ^ , ^ ^ login s, 

, _ ISVaW "^^^'^ n. ^ -. 13^2 (f) "^^'^'^ " 
^ 12X(t) 12X(t) 

which proves (18a). For F/, we obtain 

E(F/ I J^{t)) = B{Zi I J^{t)) - B{ZiI{R) I Ji^{t)) - E{Ei \ J^{t)) 

= AX + OeC-^) - OSogn) P(ii I - mO{n~^) - 0,(^) 

and 

E(F// I ^(t) & i?) < E(Z//) = ©^(logn) + 0{n-'')m, 

from which (18b) follows. 

Since X{t) - X{t + I) < 1 + F/ + F//, the tail inequahty (18b) follows immediately 
from (16b) and (16h). □ 

Lemma 22. IfJ^{t) is 2e-good, then 



-AX- 



X{t) 



E(y3(i + l)-l3(i) I^W) 



and 



\Y^{t + l)-Y^{t)\ >\og^n 



0{n 



-10\ 



(19a) 



(19b) 



Proof. Let us denote by X'{j) the number of unused variables after j iterations of the inner 
loop, i.e., before xj is used, for j = 0, 1, 2, .... In every iteration of the inner loop, regardless 
of whether in Phase I or Phase II, for every uncolored 3-clause C, there is a probability 
that the current variable xj is found to be contained in C in step (i-6. 10), or (i-3.3), respectively, 
for the zeroth iteration in Phase I. If that is the case, the 3-clause is colored, and when the inner 
loop terminates, the clause wiU no longer be a 3-clause. 
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If we suppose that, at the beginning of iteration j = 0, 1, 2, . . . , before the current variable 
Kj is treated, there are Y^{j) uncolored 3-clauses and X'{j) unused variables, then the number 
of 3-clauses which ai^e hit Xj is distributed as Bin(y3 (j), 3/^'(i))- (We have X'{j) = X{t) — j 
in Phase I, but in Phase II the value of course depends on how Phase I went.) 

For (19b), we can just use the fact that the number of 3-clauses which are colored is bounded 
from above by Mj + Mjj, the total number of colored clauses. Thus, by (16c) and (16i), this 
number is at most log^ n with probability 1 — 0{n~^'^) for n lai^ge enough depending on e. 

For the conditional expectation estimate (19a), we compute, conditional on J^{t), 

B{Ys{t + 1) - Y3{t)) = E{{Y3{t + 1) - Y^it)) I[R]) + B{{Y3{t + 1) - Ys{t)) I[R]). 
For the left summand, we have 

E{{Y3{t + l)-Ys{t))I[R]) 

< E(log2 nl[i? kYs{t + l)- Ysit) < log^ n]) 

+ B{mI[RkY3{t + l) -Ysit) >log^n]) 



< log^n P[R] + mP[Y3{t + 1) - Ysit) > log^ n] = log' n OeC^^) + 0( 



n 



' log* n ^ 



by(16f)and(19b). 

For the right summand, we have 



Zi + l 

log^ 71 ^ 



E{{Y,{t + 1) - Ys{t)) m) = e( £ GU) m) + Oe{ 

where, conditioned on 1^3 (j) as defined above, the G(j+1) are distributed as Bin(y3'(j), x<^)-j ) 
and the O(-) accounts for the possibility that Fj < Zj, cf. (17). Using (15) and a similar argu- 
ment as above, we see that 

E G{j)m]=Ei G{j)] +Oe{^). 

Computing the expectation of the sum can be done in the same way as for classical SAT 
(e.g. in [1, 4, 2]). Indeed, using the optional stopping theorem (Z/ + 1 is a stopping time for 
the histoiy of the queue together with all random processes involved; cf. the proof of the next 
lemma for the details, where the situation is essentially the same, only a bit more complicated), 
we find that 

^Zi+l s . Zi 



j = l / ^J=0 

where we agree that 0/0 = 0. By (19b), Y^{t) - log^n < Yl{j) < Ys{t) with probability 
1 — 0{n~^), and by (16b) we have Zj < log^ n, implying X{t) —j> ^X{t), with probabiUty 
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1 — 0(n ^^). Thus, we conclude 

Zi 



EE 



3=0 



m-j 



E 



{i\m)-^n<YHj) t A-(t)-,>i^(t)]^E(^+o(Mip!^ 



log^ n 



n 



by (18a). This concludes the proof of (19a). 
Lemma 23. If M'{t) is 2e-good, then 



□ 



' log n N 



(20a) 



and 



\Y2{t+l)-Y2{t)\ >\og^n 



0(n"i°) 
(20b) 



Proof. The tail inequality is obtained by referring to (16c) and (16i) again, since very clause 
which changes its length has been colored before that can happen. 

Let us denote by X'{j) the number of unused variables after j iterations of the inner loop, 
i.e., before Xj is selected. In every iteration of the inner loop, regardless of whether in Phase I 
or Phase II, for every uncolored 2-clause C, there is a xtq) probability that the cuixent variable 
Xj is found to be contained in C in step (i-6.10), or (i-3.3), respectively, for the zeroth iteration 
in Phase I. If that is the case, the 2-clause is colored, and when the inner loop terminates, the 
clause will no longer be a 2-clause. The same is true for 3-clauses which have become red in 
some previous iteration. Denote the total number of 2-clauses and pink 3-clauses which are hit 
by the current variable in some iteration over the whole run of Algorithm 2 by L2x ■ 

The analysis of the expectation and tail of L2x is almost identical to the analysis done in the 
previous lemma for the 3-clauses. Here, too, we have to condition on the number of uncolored 
2-clauses and pink 3-clauses not changing too much. The difference is the need to control the 
number of pink 3-clauses and, after a repair, the number of 3-clauses becoming 2-clauses. The 
latter two numbers are bounded from above by ¥^{1 + 1) — Y^{t), which is at most log^ n with 
probability 1 — ©^(n"^*^). Thus, for L2x, we just note that its expectation accounts for the 
summand —AX^^^ in (20a). 

Now let us denote the number of 3-clauses which become 2-clauses during the two phases 
of the inner loop by Ls^2, and let us also focus on the case when no repair occurs. 

In this case L-^^2 behaves similarly to Y^lt + 1) — Y^{t), with two differences: The prob- 
abihties that a 3 -clause is colored pink is different; and the zeroth iteration differs from the 
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Others. Let us first consider the zeroth iteration. The probability that the tentative value 1/2 of 
xq makes a 3-clause pink is 1/2 by Lemma 4. Thus, if there is no repair, this contribution is 
distributed as Bin(y3(t), i • j^)- 

For the other iterations, j = 1, 2, 3, . . . , if an uncolored 3-clause C contains the current vari- 
able xj, the probability that C becomes pink in (i-13) depends on the current interval Ij, and 
is distributed as P defined in (4). Indeed, if we denote the number of uncolored 3-clauses in 
iteration j by Y^{j) again, then, conditioned on Y^{j) and X'{j), the number G{j + 1) of un- 
colored 3-clauses which become pink in iteration j is distributed as 6111(1^3(7), ^^x''(t)^ )' 
binomial with random parameter P{j + I). The P{j) are the iid random variables distributed 
as P in (4) defined by x{lj), in other words P{j + 1) = 1 — 2x(lj)(l — x{lj)). 

Let G(l) be distributed as Bin(y3(t), ^^), define D{j + 1) := G{j + 1) - ^^^jfi, where 
we agree that 0/0 = 0, and denote by ^(j) the history of the process up to iteration j, i.e., 
before the variable xj is treated. Then Yl^j=i ^U)' ^ — 1, 2, 3, . . . , is a martingale with respect 
to ^(j), j = 0, 1, 2, . . . , and Z/ + 1 is a stopping time, because deciding whether Z/ + 1 < £ 
amounts to checking whether Qi (i) = 0. 

To estimate the expectation of the contribution of these, we use the optional stopping the- 
orem again; note that the stopping time is finite with probability one, because Zj < m. We 

conclude that E (j2jLV ^U)) =0' which means 



8X'{j) 



V i=i / \j=o 



Arguing as we have done a number of times in regard of the possible deviations of Y'{j) from 
Y{t), we see that the right hand side equals 

Getting rid of the conditioning on the event that no repair occurs is done in the same was as 
in the previous lemma, and we leave the details to the reader. □ 

6.4. Failure probability. We now bound the probability that an empty clause is generated by 
a run of the inner loop, including, possibly, the repair and following second phase. 

Lemma 24. If r^{t) is 2e-good, then the probability that Algorithm 2 produces an empty 
clause, is o{^/n). 

Proof. We use Lemma 1. Let us first deal with Phase IL The probability that the algorithm 

goes Zen in Phase II is Og ( " ) by (16j), conditioned on a repair occurring, so that by the 
law of total probability, the probability that the algorithm goes Zen in in Phase II is at most 

0,(i2I^),by(16f). 

For Phase I, we need to go through the possible reasons for the algorithm to go Zen. First of 
all, by (16e), the probability that the current variable Xj is contained in a colored (red or not) 

clause other than the current one C, is 0,(1^), which takes care of step (i-6.3). 

The probability that a fixed clause contains the cuiTcnt variable of a fixed iteration depends 
only on the number of variables and the number of unexposed atoms in the clause, and so it 
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can always be bounded by In order for a 3-clause to become red or blue (or even black), it 
must contain the cun^ent variable of (at least) two iterations. The probability of this happening 
is Oe(^^p^)> where we have used (16b). This gives the case of step (i-9.1). 

Similarly, for step (i-9.2), a 2-clause must have been hit twice by the current variable of an 
iteration, the probability of which is again bounded by Oe(^^^). 

In total, the failure probability can be bounded by O ( 22k}^ ) □ 



7. The outer loop 

At the heart of analysis of the outer loop is the well-known theorem of Wormald's which, in 
certain situations, allows to estimate parameters of random processes by solutions to differen- 
tial equations. Here is the first goal of our analysis. 

Lemma 25. For every c € ]0, 3], the initial value problem 

dy _ -18cx^ + 2y{l2x - y) 

~i — — 7 \ (^Zia.j 

ax x(\2x — y) 

y{l) = (21b) 
has a unique solution y defined on the interval ]0, 1]. 

See Fig 1 for a rough sketch of the direction field (21a) with c = 2, and a solution to the 
IVP. Since, ultimately, we will solve the IVP (21) numerically for the right value of c anyway, 
strictly speaking, this lemma is not needed to complete our argument. However, we would like 
to reduce our reliance on numerical computations as much as possible. 



Proof of Lemma 25. To use the known theorems on IVPs, note that the right hand side of (21a), 
seen as a function of (x, y), is continuously differentiable on {(x, y) | x > 0, y < 12x}. 
We make the following claims: 

(a) For < X < 1, the solution to the IVP never crosses the line y = 5(1 — x); 

(b) for < X < 4/5, the solution to the IVP never crosses the line y = 6x. 

Thus, the solution to the IVP does not approach the y = 12x, which implies that the solution 
extends to the whole interval ]0, 1]. 

Let g{x, y) := ^^^Z\iS-yf~^^ ' ^^^^ ^^^'^ ^^'^^ of prove claim (b), 

it suffices to show that, with y{x) := 6x, whenever < x < Ys, we have ^ < g{x,y{x)). 
The computation is easy but tedious and can be found in the appendix, see A.5. Similarly, for 
claim (a), with y{x) := 5(1 — x), for every 4/5 < x < 1, we have ^ < g{x,y{x)). The 
computation is in the appendix, too. □ 

Lemma 26. Let c < 3 and y a solution to (21), and let xq be the infimum over all x > 3efor 
which 



I3y{x) < (1 - 3£)12x 



(22) 
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Figure 1. Direction field and solution for IVP (21) 



holds. Then there exists a r > and a strictly decreasing smooth function x: [0, r] 
a;(0) = 1 and x(r) = xq, such that whpfor all t with */n < t: 



R with 



X{t) =nx{t/n) + o(n) 
Y2it) =ny{x{t/n)) + o{n) 
Ysit) =ncx{t/nf + o{n). 



(23a) 
(23b) 
(23c) 



Moreover, we have the relationship 



dx 
lit 



y{x) 



X - ^y{x) 



12 X — y{x) 
12 x - 13y(x) 



(23d) 
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Proof. For the proof we use Wormald's well-known theorem, which requires some set up and 
computations. Using the notation of Theorem 5.1 in [40], let 



D 




Co 


:= 10 




:= log^ n 


7 


:= 3n-2 


Ai 


log^ n 
n 


A 


\og^^ n 

- ^1/3 ' 



Note that A > Ai + Con7, and A = o(l), as required in Theorem 5.1 in [40]. 
Obviously, we have Q < X,Y2,Y^ < Cqu. 

(i) Equations 18c, 20b, and 19b, respectively, show that, if {t,X{t)/n,Y2{t)/n,Y^{t)/n) € 
D, then, conditioned on M'{t), the probabihty that X{t + 1) - X{t) < p, Y2{t + 1) - 
y2{t) < f3, and Ys{t + 1) - Y3{t) < (3 hold, is at least 1 - 7. 

(ii) The first parts of Lemmas 21, 23, and 22, respectively, show that, if {t,x,y2,y3) '■= 
{t,X{t)/n,Y2{t)/n,Y3{t)/n) e D, 



fit, X, y2, 2/3) - E(X(t + 1) - X{t) I je{t)) 
52(t, X, 2/2,2/3) - E(F2(i + 1) - Y2{t) I je{t)) 
93{t, X, 2/2,2/3) - E(F3(t + 1) - Y^it) I je{t)) 



< Ai 

< Ai 

< Ai, 



where 

122/2 



f{t,x,y2,y3) ■■= -1 



12x - 132/2 



/, N 32/3(t) 132/3(0 , N22/2(t) 

92{t,x,y2,y3) ■■= + (1 - f{t,x,y2,y3)) ^^^^^ + f{t,x,y2,y3)—^^ 

32/3 (i) 



93{t,x,y2,y3) := f{t,x,y2,y3 



xit) 



(iii) There exists an L depending on e such that /, (72, 2/3 are L-Upschitz continuous on D. 
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Let j;, y2, 2/3 be the solution to the initial value problem 

dx 

— = f{t,x{t),y2{t),y3{t)) (24a) 

^=g^{t,x{t),y2{t),ys{t)) (24b) 
at 

^ = g^{t,xit),y2{t),y3it)) (24c) 

x{0) = l y2(0) = y3(0) = c. (24d) 

From Wormald's theorem, we conclude that with probability 

l-0(n7|e-"(V^)^)=l-0(i), 

it is true that, for alH = 0, . . . , an, we have X{t) = nx{t/n) + 0(An), Y2{t) = ny2{t/n) + 
0(An), and Y'i{t) = ny3{t/n) + 0{X/n), where cr = a{n) is the supremum over all s for 
which the solution to (24) can be extended before reaching within a distance of CX from the 
boundary of Z), for a large constant C. 

We now need to study the initial value problem (24). Let us start with the first equation (24a), 
which we write as 

dx 12 x — 2/2 

It ~ ~12x - 13y2' 

dt = ^p^dx=(l--^)dx, (25) 
12 a;- 2/2 ^ I2x-y2^ 

dys ^ dxSys 
dt dt x ^ 



which amounts to 

The third inequality 
is equivalent to 



which immediately integrates to^ 



dys ^ 32/3 
dx X 



2/3 = cx^, 



where the constant before the x^ is derived from the initial value conditions 2/3(0) = c and 
x(0) = 1. Finally, we write the second equation as 

dm _ _m_ _ dx_ 132/3 _^ dx 22^ _ _ ys _ Hcx^^ + ^2/2 dx 
dt Sx dt 8x dt X 8x 8 dt X dt 
from which we obtain 

— = --x^— - —cx^ + — 
dx 8 dx 8 X ' 

which, by (25), yields 

dy2 c 2l2x- 132/2 13 2 22/2 -IScx^ + 22/2 (12x - 2/2) 
— — = -X CX H = . 

dx 8 12x — 2/2 8 X x(12x — 2/2) 



-1 

It should be noted that this is the same relationship between x and y3 as in the case of classical 3-SAT (see [2]). 
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which is an ODE of the function 7/2 in the variable x. In fact, with 2/2(1) = 0, we recognize the 
IVP (21), and thus y = ?/2 in the interval on which both are defined. 

To summarize, we have 1/3 = cx^, and 7/2 = y as a function of x is a solution to the IVP (21), 
and X as a function of t solves the ODE (24a) with boundary condition x(0) = 1. 

From Lemma 25, we know that the solution ?/ to (21) can be extended to a solution of the 
IVP define on the full interval ]0, 1]. Moreover, ^ < whenever 13y(x) < 12x, so the 
derivative of x is strictly negative provided that x > xq. This implies that the solutions x, 
1/2, Us to (24) can be extended to the interval [0, r], where r is the unique number satisfying 
x{t) = xo; in particular we have a < t. 

This completes the proof of the lemma. □ 

We are now ready to prove Theorem 2. 

Proof of Theorem 2. Lemma 26 gives the behavior of the parameters X{t), Y2{t), and l3(t) 
up to an error with high probability for all t = 0, . . . , r??,. We need to check that 

(a) the algorithm terminates before t grows beyond rn, 

(b) in this region of t, whp, the algorithm does not produce an empty clause. 

For (a), we solve the IVP (21) numerically for c = 2.3. The solution is drawn in Fig. 2. The 
figure also shows the line 12y = 13x. For this value of c, we see that there is an e > such 
that the solution y{x) to the IVP (21) satisfies 12y{x) < 13(1 + 2e)x for all x > 2e; w.l.o.g., 
we may assume that e < 1/9. Consequently, the xq from Lemma 26 equals 3e. Algorithm 1 
terminates as soon as Y2{t) + l3(t) < c' X{t). Thus, by Lemma 26, we have an s > r such 
that x{s) = 1/3 > xq, and that, if we let c' := ||, whp, for this t := [sn] 

Y2{t) + Ysit) = ?iy(V3) + nciysf + o(?i) 

< n((l - + i) + o{n) < = ^ ■ < c'X{t) - o(n), 

if n is large enough. Thus, the algorithm terminates before the parameters X{-), Y2{-), Y^{-) 
fail to be 2e-good. 

It follows that Lemma 24 gives a failure probabihty of o(Vn) per iteration, so that the total 
failure probability is o(l). This proves (b) and completes the proof of Theorem 2. □ 

8. Conclusion and outlook 

We have given an algorithm for fc-iSAT, for A; = 3, which succeeds with high probability on 
instances for which "^/n < 2.3. Though it is conceptually easy to extend to general k the algo- 
rithm and the analysis up to the point where the initial value problem has to be solved, the need 
to numerically solve a system of A; — 2 ordinary differential equations makes it seem improba- 
ble that a general formula for the maximal ratio can be derived. Solving the system for small 
values of k, we obtain the results shown in Table 2 (we always rounded down generously). 



k 3 4 5 6 7 8 

max. ™/n 2.3 3.75 6.25 10.5 18.5 32.5 



Table 2. Performance for different values of k 
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Figure 2. Solution of IVP (21) with bounding curves 



A couple of issues remain. Firstly, a bound for the ratio above which random 3-iSAT for- 
mulas are wpp/whp not satisfiable might be interesting. 

Secondly, there might be a threshold behavior for random 2-iSAT, similar to the situation for 
classical 2-SAT [20, 27]. In fact, we conjecture that there is a threshold at c = (the value 
from Proposition 11). For this it remains to prove that for c > 3/2, a random 2-iSAT formula 
with "i/n = c is whp not satisfiable. 

Finally, we would like point out that several papers have raised the question of a threshold 
for random regular 3-iSAT [10, 12, 15, 37]. The situation there is likely to be easier than for 
iSAT. 

We would like to close by thanking various unnamed people for their silent encouragement. 
The second author expresses his gratitude for the superb working environment at the Casa del 
Gelato. 
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Appendix A. Deferred proofs 



A.l. Computations for Lemma 6. For (a), we compute 




/ 



[0,V2[ 



2t{l 



t)dtF{t)dt + 2t{l-t) 



1=1/2 




L 



2t(l - t)dtF{t) dt 




2t{l - t)2t dt + - + / 2t{l 
4 Jih 
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For (b), we compute 

E{X^{1 - Xf) = J - t)2 dF{t) 



t\l-tf -1+ f f{l-tfdtF{t)dt+ [ t^(l-tfdtF{t) 
= 4+/ t^{l-tf2tdt+ t^{l-tf2{l-t)dt 

J[0l/2[ J] 1/2,1] 

1-1/2 

= ^ + 4/ t^{l-tfdt 
Jo 



t=o 



1/2 

t=0 ^ 



1/2 

t=0 



— 1 _|_4('i 1 _lX 1 4_J_ 1 ^ — J__L 1 (^-\_l_l_J_\ — 1 I 22 _ 1 I 11 _ 15+11 _ 13 

— 25'"'"^U 2^"'"10 2^"'~60 2^'' 25"'"2^V-^"'"5"'"15/ ~ 25~'~T5^ 25~'~15-25 15-2^ Ts^" 

Hence, using (a), we obtain 

E(P2) = 1 - 2(1 - EP) + 4EX\1 - X)2 = 1 - 11 + 13 = 18 = ^. 

A.2. Proof of Lemma 12. The proof is taken almost word for word from Grimmett & Stirza- 
ker [28], Theorem 11.3.17, with some changes due to the discrete aixival- and servicing points. 

We say that the sons of a customer Paul are those customers arriving in the time interval in 
which Paul is sei^viced. Paul's /am//)' consists of himself and all of his descendants. 

Fix a time intei-val j in which the queue is not empty and denote by X the size of the family 
of the customer served at that time interval. We have the relation 

B(j+l) 

x = i+ Yl 

i=l 

where Xi denotes the family size of the i'th customer arriving in the time interval j. 

The important observation now is that the family sizes are iid because the B{j) are iid, and 
that the Xi are independent of B{j + 1). Consequently, for the common probability generating 
function y of X and the Xi, we have 

y{x) = X gg{y{x)). (*) 

The length fist busy period coincides with sum of the family sizes of the a customers arriving 
in the first time interval. Thus, we obtain 

h{x) = y{xY. (**) 

Solving (*) for x and inserting into (**), we obtain 

M^) = y(^r- (***) 

If y(0) = 0, then i? = 0, and thus h{y) = y"-, which coincides with equation (5a). Otherwise, 
by (***), equation (5a) holds for all y in the interval [y(0), 7/(1)], and thus for all y for which 
the power series on both sides of the equality sign converge. 

We derive the statement about the mean length of the first busy period by differentiating (5a), 
and possibly invoking Abel's Theorem to evaluate the power series at the point 1. 
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Finally, the statement about the tail probability follows directly from the standard exponen- 
tial moment argument: If y > gg{y) > 0, then, with x := y/gg{y) > 1, we have 

P[Z >a]= P[x^ > x"] < = — - ^ - ^''^^^ 



as claimed. 

A.3. Computations for Lemma 14. Computations regarding equation (**): 

Oir + -r^^ar u — (a — a) = U 

13^ ^ u + l 

ar{u + 1) + -^^ar'^u{u + 1) — (a — a) = 
(^^^ar^^u^ + (^ar + -^^ar'^^u — ((1 — r)a — a) = 



ar + iP^ar^] ± (ar + ^^^ar^j + 4((1 - r)a - a) (^^^ar^ 

2 • 



We need to be close to 0, so we take the "±" 



9 . If 2 



13^ 



12^ 
13^' 



1 + ifeO + V (l + liiO' - + 4(1 - 



12^ 



l + Tf3-)+\/ il-Tg50'+4(l-aA)J^ 



0(V"), 



with an absolute constant in the O(-), because a < a and 1/2 < r < 1. 



34 



KATHRIN BALLERSTEIN AND DIRK OLIVER THEIS 



Computation regarding equation (****): 

l*)]{ur) aru + ^^:^ar'^u^ -{a- a) log{u + l] 



a a 

= rur + ^r\l - (1 - a/a) log(n^ + 1) 

= rUr + ^r'^ul - ^Og{Ur + 1) + 0(ya), 

with an absolute constant in the O(-), because Uj. + 1 < 2. 

A.4. Proof of Lemma 17. Suppose that the B{j) are represented as a sum as in (10) above, 
and define 

i=i 

Then the B'^{j), j = 1, 2, 3, . . . , are iid, so that Lemma 16 is applicable. The same is true for 
the B~{j), j = 1, 2, 3, . . . . We clearly have, with probability 1 — 0(n~^), 

B-{3) < B{j) < B+{j) for all j = 1, . . . , z. 

Defining two queues Q^{i) based on the B^{j) and respective lengths of first busy periods 
Z^, we obtain, with probability 1 — 0{n~''') 

z- <Z < Z+, (*) 

where we have also used that Z^ < z with probability 1 — 0{n~^) (Lemma 16). 

Denote by E the event that (*) holds. If (*) does not hold, we still have Z = 0{n) by (11), 
so that we obtain 

EZ = E{Z I E) P{E)+E{Z I E) P(E) < E(Z+ | E) P(S)+0(?i^"'") < E{Z+)+0{n^-'') 

For the lower bound, we similarly have 

EZ> E{I{E)Z-) = E(Z-) - E{I(E)Z~) 

Clearly, E{I(E)Z~) <zP(E) + E{I(E)Z- 1[Z- > z]) = zP(E) + mO{n~'") = 0{n^-'') 
Thus we conclude that E Z > E - 0{n}-''). 
For the tail estimate, we use Z~^ : 

P[Z >a] <P[Z > a &i Z < Z+]+P[Z > a &i Z > Z-^] 

< P[Z+ >a]+ P[Z > Z+] < e-^° + 0(n-'') 

by Lemma 16. 



A.5. Computations for Lemma 25. 
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For the proof of Claim (b). Let g{x,y) := ^^^^x{i2x-^y)^ ' right hand side of the 

ODE (21a). As mentioned in the proof of the lemma, we show g{x,y{x)) > 6 = for 
< X < 4/5. We compute 

-18cx^ + 2 • 6x(12x - 6x) _ -IScx^ + 2 • 6(12 - 6) _ -IScx^ + 72 
^^""'^^^^^ " x(12x - 6x) " (T2^^6) " 6 

9-16 25- 12 - 9- 16 



-3cx^ + 12 > -9x^ + 12 > -9(4/5)^ + 12 = 12 



c<3 25 25 

12(25 -3-4) 12-13 



> 6. 



25 25 

For the proof of Claim (a). Let g{x, y) as above. As mentioned in the proof of the lemma, we 



show g{x, y{x)) > -5 = ^, for 4/5 < x < 1. To show that 



, -IScx^ + 2 • 5(1 - x)(12x - 5(1 - x)) 

g{x,y{x)) = — ^ ^-^ ^ > -5, 

x(12x — 5(1 — xj) 

we compute 

- 18cx^ + 2 • 5(1 - x)(12x - 5(1 - x)) + 5x(12x - 5(1 - x)) 
= -18cx^ + 10(1 - x)(17x - 5) + 5x(17x - 5) = -18cx^ + (10 - 5x)(17x - 5) 
= -IScx^ - 85x2 _^ -^g^^ - 50 > -54x'^ - BSx^ + 195x - 50. 

c<3 

The derivative — 216x^ — 170x + 195 of the last polynomial is strictly decreasing, and evalu- 
tating it at 4/5 gives -216(4/5)^ - 170 • 4/5 + 195 ^ -51.592 < 0. Thus, it suffices to check 
the inequality -54x4 _ 35^2 _^ ^95^ - 50 > for x = 1: -54 - 85 + 195 - 50 = 6 > 0. 
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